Draft version February 5, 2008 

Preprint typeset using I^T^X style cmulatcapj v. 6/22/04 



THE DEMOGRAPHICS OF LONG-PERIOD COMETS 
Paul J. Francis 

Research School of Astronomy and Astrophysics, the Australian National University, Mt STromlo Observatory, Cotter Road, Weston 

ACT 2611, Australia 
Draft version February 5, 2008 

ABSTRACT 

The absolute magnitude and perihelion distributions of long-period comets are derived, using data 
from the Lincoln Near-Earth Asteroid Research (LINEAR) survey. The results are surprising in three 
ways. Firstly, the flux of comets through the inner solar system is much lower than some previous 
estimates. Secondly, the expected rise in comet numbers to larger perihelia is not seen. Thirdly, 
the number of comets per unit absolute magnitude does not significantly rise to fainter magnitudes. 
These results imply that the Oort cloud contains many fewer comets than some previous estimates, 
that small long-period comets collide with the Earth too infrequently to be a plausible source of 
Tunguska-style impacts, and that some physical process must have prevented small icy planetesmals 
from reaching the Oort cloud, or have rendered them unobservable. A tight limit is placed on the 
space density of interstellar comets, but the predicted space density is lower still. The number of 
long-period comets that will be discovered by telescopes such as SkyMapper, Pan-Starrs and LSST is 
predicted, and the optimum observing strategy discussed. 
Subject headings: comets: general — Oort Cloud — solar system: formation 



1. INTRODUCTION 

The Oort cloud <|Oortl Il950fi remains the most mys- 
terious part of our solar system, primarily because 
it cannot be directly observed. Our only observa- 
tional clues to the size, shape, mass and composi- 
tion of the Oort cloud come from observations of long- 
period comets. The demographics of observed long- 
period comets have been the starting poin t of almost 
all attemp ts to model the Oort clo ud (eg. lOortl 119501: 
Weissmahlll99a iWiegert fc Tremainelll999l iDones et alJ 
20041) . 

Until the last ten or so years, the vast majority of 
comets were discovered by systematic eyeball searches, 
using small telescopes (|Huarhesl ISOOIL These surveys 
have been highly effective at identifying large samples of 
comets, and in deriving their orbital parameters. They 
do, however, have three major drawbacks: 

i 

• Unknown selection function: it is very unclear how 
often different parts of the sky are surveyed, and to 
what depth. Surveys are clearly more sensitive to 
comets with bright absolute magnitudes and per- 
ihelia close to the Earth, but the strength of th is 
effect is very hard to estimate ( Everharti ll967albT ) . 

• Limited range of comets observed: eyeball surveys 
find few comets fainter than an absolute magnitude 
of 10 and with perihelia beyond 3AU. 

• Poorly defined photometry: these surveys quote 
the "total brightness" of a comet. Total cometary 
magnitudes are notoriously unreliable. They are 
typically measured by defocussing a standard star 
to the same apparent size as the comet, but this ap- 
parent size is heavily dependent on observing con- 
ditions and observational set-up. 
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Despite these drawbacks, many attempts have been 
made to derive the basic parameters of the long period 
comet population from eyeball-selected historic samples. 
The mos t heroic and influential attempt was that of 
lEverhartl l|1967b() . Everhart carried out an exhaustive 
analysis of the historical circumstances in which comets 
were discovered, over a 127 year period. He developed a 
model for the sensitivity of the human eye, and used it to 
calculate the period over which a given historical comet 
could have been seen. This was then used to estimate 
the completeness of the comet sample: if a given type of 
comet was typically seen early in its visibility window, 
surveys should be complete for this type of comet. If the 
mean time to find a given type of comet is, however, com- 
parable to the length of the estimated visibility window, 
the completeness is probably low. Using this method, 
Everhart estimated that for every comet seen, another 
31 were missed. 

A m ore modest and recent attempt was that of Hughes! 
l)2001|) . He restricted himself to the brightest and near- 
est comets, for which he claimed (on the basis of dis- 
covery trends) historical surveys were highly complete. 
The statistics of these comets were simply extrapolated 
to larger perihelia and fainter absolute magnitudes, with 
no correction for observational incompleteness. As one 
would expect, the flux of long-pe riod comets thr ough the 
inner solar system estimated bvlHughesI (120011) is much 
lower that that estimated bv lEverhartJ(|1967b|) . 

Despite these attempts, several basic questions 
about the demographics of long-period comets re- 
main unresolved. O ne question concerns small comets 
l)Brandt et al.lll99^ : those with nuclear radii less than 
~ 1km (absolute magnitudes H > 10). Extrapolating 
the Everhart data implies t hat there shoul d be a large 
population of such comets. iHughesI l|200lD was unable 
to tell whether his model predicted a large population 
of such comets or not. A second que stion conc e rns the 
number of comets per unit perihelion. lEverhartl l)1967bf) 
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found that this number rises from the sun out to 1AU, 
but was unable to determine w hether it keeps rising at 
larger perihelia. iHughesI 1)200 If) found no significant rise, 
but had large enough error bars to bracket both of Ev- 
erhart's possibilities. 

The observational situation has changed radically in 
the last few years. The advent of large format sen- 
sitive CCDs has allowed automated surveys to sup- 
plant eyeball searches as the main mechanism for find- 
ing new long-period comets 1 . Most long period comets 
are now being found as by-products of various auto- 
mated searches for near-Earth objects, such as the Lin- 
coln Near-Earth A steroid Research (LINEAR) project 
ijStokes et alJ 12000ft . the Catalina Sky S urvey 2 , LO- 
NEOS 3 and NEAT 4 (|Pravdo et al.l fl999^ . Many are 
also found by space-based cor onagraphs as they ap - 
proach very close to the Sun (Bicscc ker et alJ [2002). 
though these are most ly fragments of recently disinte- 
grated larger comets (|Sekanina fc Ch odas 20041). 

In this paper, I attempt to deduce the statistical prop- 
erties of the long-period comet population from one of 
these CCD surveys: the LINEAR survey. This has a 
far better defined selection criterion than any historical 
eyeball survey, and extends to much larger perihelia and 
fainter absolute magnitudes. It thus allows both an in- 
dependent check and an extension of previous estimates 
of the long-period comet population. 

Near Earth asteroid (NEO) surveys are not optimized 
for comet detection. While they find many long-period 
comets, they do not publish their raw data, nor all the 
details one would like of their exact detection algorithms 
and sky coverage. In particular, they do not publish on- 
going photometry of the comets they discover. Nonethe- 
less, enough information is available to make a first pass 
at estimating the true population of long-period comets 
from their data. There have been previous attempts 
to use these surveys to detemine the true populations 
of NEOs (eg. Jedicke et alJ 12003ft and dormant comets 
ijLevison et al, 12002ft . but this paper is the first attempt 
of which I am aware to do this for active comets. 

In the next few years, the situation should further 
improve, with the advent of a new generation of wide- 
field survey telescope s, such as S kyMapper 5 , Pan-Starr s 
l|HodaDP et al] 12004ft and Gaia ijPerrvman et alJ 12001ft . 
These surveys will predominantly find comets much 
fainter and more distant than historical surveys. The 
analysis in this paper allows a first estimate of just how 
many long-period comets these surveys can find, and how 
best to identify them. 

I start off by defining a sample of comets drawn from 
the LINEAR sample, and examining its properties, which 
are very different from those of eyeball samples (§|2J. A 
model of the long-period comet population is then gen- 
erated (§ |3J) based on and extrapolating the historical 
eyeball-selected surveys. A Monte-Carlo simulation of 
this comet population as it would be observed by LIN- 
EAR is then developed (§EJ- The results are compared 
to the observed sample in § [SJ I find that the Hughes 
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model is quite a good fit to the data, but that the Ev- 
erhart model is not. I derive my own best-fit model of 
long-period comet demographics. The consequences of 
this new model are many: I examine them in §0 before 
drawing conclusions in § 

2. THE COMET SAMPLE 

Of the several near-Earth asteroid surveys now under- 
way, the Linc oln Near-Earth As teroid Research (LIN- 
EAR) project ijStokes et al.N200'oj) was most suitable for 
constraining the long-period comet population. This is 
because: 

• They discover more comets than any other single 
survey. 

• They publish sky charts on their web page 6 show- 
ing the area of the sky observed during each luna- 
tion, with the point-source magnitude limit reached 
at each location. 

• Their sky coverage and magnitude limit is rela- 
tively simple and uniform across this period. 

The comet sample was defined as follows: 

1. The comet has an orbital period longer than 200 
years. 

2. The comet reached perihelion between 2000 Jan 1 
and 2002 Dec 31. 

3. The comet was either discovered by LINEAR be- 
tween these dates, or could have been discovered 
by LINEAR between these dates had it not al- 
ready been discovered by someone else, or discov- 
ered prior to 2000 Jan 1. 

The 2000-2002 date range was chosen because 
comet details (from the C atalog of Cometary Orbits, 
IMarsden fe Williami200"3|) and sky -maps (including lim- 
it ing magnitudes) are availab le. 

IMarsden fe Williams! (2003) listed 25 comets as having 
been discovered or co-discovered by LINEAR which met 
our criteria. I needed, however, to add two additional 
sub-samples: 

• Comets discovered prior to 2000, but which reach 
perihelion in the period 2000-2002, and which could 
have been first discovered by LINEAR within this 
period, had they not already been found. 

• Comets discovered in 2000-2002 inclusive by other 
surveys, but which would subsequently have been 
seen by LINEAR during this period. 

Potential membe rs of the two addition a l sub- samples 
were selected from IMarsden fc Williams! (|2003ft . Each 
candidate was checked for its detectability by LINEAR, 
using the ephemerides and predicted magnitudes gener- 
ated by the Minor Planet Center 7 . The predicted po- 
sitions and brightnesses were compared to the maps of 
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LINEAR sky coverage. These maps show only the inte- 
grated coverage per lunation, not the night-by-night or 
hour-by-hour coverage, but most of these comets move 
slowly enough that this shouldn't much matter. 

This process added another 27 comets to our sample. 8 
had been detected by LINEAR during 1999, but reached 
perihelion in 2000 or 2001. Most of the remainder were 
first identified by other near-Earth asteroid surveys, par- 
ticularly the Catalina Sky Survey, LONEOS and NEAT. 

For every comet in our final sample, the original dis- 
covery details (as distributed by the Central Bureau of 
Astronomical Telegrams) were checked. From these, the 
discovery date, discovery magnitude ffdis and discovery 
circumstances were noted. The discovery magnitudes are 
total magnitudes (ml). It is not clear how reliable and 
homogeneous these magnitudes are, but no better source 
of CCD photometry is available. They are based on 
CCD observations by professional astronomers of typi- 
cally barely resolved objects, and so should be good to 
~ 0.5mag. 

Absolute magnitudes H were computed from these dis- 
covery magnitudes i/dis- The standard equation was 
used: 

H dis = H + 51og 10 A + 2.5nlog 10 r (1) 

(eg. iWhippld EP78) , where ffdis is the observed total 
magnitude at discovery and n a power-law parameter- 
ization of the dependence on heliocentric distance. As 
is conventional for solar system work, the absolute mag- 
nitude is defined as the observed magnitude if the ob- 
ject were at a distance of 1 AU f rom b oth the Earth 
and the Sun. Following iWhippiel |l978), the dynami- 
cally new and old comets were treated differently (the 
new ones are much brighter at large heliocentric radii, at 
least on their way in) . A comet is classed as dynamically 
new if its original semi-major axis a is > 10,000 AU, old 
if a < 10,000 AU, and undete rmined if the orbit class 
in IMarsden fc W illiamsl l|2003|) is II or worse. For new 
comets, n = 2.44 was used if they are seen pre-perihelion 
and n = 3.35 if seen afterward. For old comets, the val- 
ues are 5.0 and 3.5 respectively. The canonical value of 
n = 4 is used for comets of undetermined orbit type. This 
is uncertain both because real com ets show a disper sion 
in n, and because the n values in IWhippld l|1978j) are 
based on observations at smaller heliocentric distances. 
It is, however, self-consistent with the analysis used in 
our Monte-Carlo simulations. 
Our sample is listed in Tabled 

2.1. Properties of the Sample 

The LINEAR sample has very different properties from 
historical samples (as typified by the Everhart sample). 
Figure^shows that the LINEAR sample extends around 
4 magnitudes deeper, and to much larger perihelia. The 
overlap is small: only ~ 5 of the LINEAR comets lie 
within the absolute magnitude and perihelion region 
sampled by historical samples. 

Analysis of the discovery telegrams indicates that al- 
most all of the comets in the sample were originally iden- 
tified as moving point sources. They were posted on 
the Near Earth Object (NEO) confirmation page 8 at the 
Minor Planet Center. Follow-up observations then de- 
termined that the sources were spatially extended and 

8 http://cfa- www . harvard . edu/ iau/NEO /ToConf irm . html 



TABLE 1 

The LINEAR Long-Period Comet Sample 



Name 


q (AU) 


H 


l/a a 


Orbit class 


C/1999 


Fl 


5.7869 


7.82 


000038 


1A 


r< /i qqq 


32 


7.1098 


6.39 


0.000019 


1A 


C, /I QQQ 


K5 


3 2558 


9.75 


000024 


1A 


r< /i qqq 


K8 


4.2005 


6.33 


0.000681 


1A 


c, /I QQQ 


L3 


1.9889 


10.21 


01 3741 


IB 


Cj /I QQQ 


N4 


5.5047 


9.99 


0.000068 


1A 


C/1999 


SI 


0.7651 


7.84 


000720 


11 


r< /i qqq 


Tl 


1.1717 


4.36 


000173 


11 


C, /I qqq 


T2 


3.0374 


6.05 


000596 


1A 


r< /i qqq 


T3 


5.3657 


5.12 


0.000231 


IB 


n /I QQQ 


Q4 


4.9153 


7.60 


000037 


1A 


C /I QQQ 


Yl 


3.0912 


9.80 


0.000044 


1A 


c, /9nnn 


Al 


9.7431 


8.13 


000044 


1A 


c, /9nnn 


CT54 


3.1561 


10.71 


000051 


1A 


c, /9nnn 

\s j ^uuu 


HI 


3 6366 


10.29 






n /9nnn 

\-j j ^uuu 


Jl 


2.4371 


12.62 


001 406 


1A 


c, /9nnn 


Ol 


5.9218 


7.03 


000037 


1A 


c /9onn 


OF8 


2.1731 


14.07 


000048 


IB 


C/2000 


SV74 


3.5416 


9.40 


000090 


1A 


n /2000 


TJ5 


3.4852 


9.88 


000358 


1A 


c /2000 


Wl 


0.3212 


10.44 






c, /2nnn 


WM1 


0.5553 


6.81 


-0.000459 


11 




Yl 


7.9747 


9.54 


0.000063 


1A 


C/2000 


Y2 


2.7687 


9.65 


001934 

U i UU -L Jut: 


IB 


C, /2001 


Al 


2.4062 


10.71 


0.005738 


2 A 


O/2001 


A2 


0.7790 


14.22 


000447 


11 


C/2001 


Bl 


2.9280 


11.14 


0.000071 


IB 


Cj /2001 


B2 


5 3065 


5.60 


000187 


IB 


C /2001 


CI 


5.1046 


10.30 


0.000020 


1A 




CI 


8.2356 


7.45 


0.000024 


1A 


C/2001 


HT50 


2.7921 


3.15 


0.000878 


1A 


C /2001 


K3 


3.0601 


9.80 


000072 


IB 


\-j j ^UU X 


K5 


5.1843 


8.10 


00002Q 


1A 




N2 


2.6686 


5.77 


0.000455 


1A 


C/2001 


RX14 


2.0576 


6.06 


000776 


1A 


Cj /2001 


SI 


3.7500 


11.36 


018168 




C/2001 


1 16 


4.4064 


7.42 


000998 


1A 


C/2001 


Wl 


2.3995 


14.04 






C/2001 


XI 


1.6976 


12.54 


002285 


2 A 


C/2002 


B2 


3.8430 


10.12 






C/2002 


B3 


6.0525 


7.92 






C/2002 


C2 


3.2538 


8.88 


0.000393 


IB 


C/2002 


E2 


1.4664 


10.34 


0.000173 


IB 


C/2002 


112 


1.6348 


13.29 


0.004024 


2A 


C/2002 


K2 


5.2378 


7.62 






C/2002 


L9 


7.0316 


5.60 


0.000035 


2A 


C/2002 


04 


0.7762 


13.59 


-0.000772 


2A 


C/2002 


PI 


6.5307 


8.55 


0.002023 


2A 


C/2002 


Q2 


1.3062 


16.09 






C/2002 


Q5 


1.2430 


16.59 


0.000058 


IB 


C/2002 


1.12 


1.2086 


14.63 


0.001075 


IB 



a Where available, this is the reciprocal of the original semi- 
major axis, ie. before plane tary perturbations. Taken from 
IMarsden fc Williamsl {2003) 

b Quality flag for the orbit determination. Original semi- 
major axes only available for classes 1A thru 2B. 



hence comets. 77% were discovered before reaching per- 
ihelion (Fig and 73% were were first detected when 
more than 3AU from the Sun. 

The necessity for follow-up potentially introduces two 
sources of incompleteness into this sample. Firstly, some 
fraction of objects posted on the NEO confirmation page 
are never followed up in enough detail to determine 
whether they are comets or not. Timothy Spahr kindly 
provided records of all objects posted to the NEO con- 
firmation page in 2000-2002. Only 11% of these were 
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Fig. 1. — The location of our comet sample in the absolute magnitude vs perihelion plane, as compared to the Everhart (1967) sample. 
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Fig. 2. — The time interval between the discovery of a comet and its perihelion passage (solid line). Negative values indicate that the 
comet was discovered before passing perihelion. For comparison, the dotted line shows our prediction of this distribution, from the best-fit 
model comet population. 
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not followed- up well enough to determine an orbit: this 
places an upper limit on the fractional incompleteness 
of our sample due to failed follow-up. This is probably 
a conservative upper limit: most of these lost objects 
were most likely either not real to begin with or fast- 
moving objects only visible for a short window of time. 
Secondly, some comets might have been inactive at these 
large heliocentric distances, and hence classified as minor 
planets. The minor planet centre database was checked 
for non-cometary objects on long-period, highly eccentric 
orbits, but only one was found which reached perihelion 
within the period 2000-2002: 2002 RN109. Thus this too 
is not a major source of incompleteness. It also shows 
that most comets down to the LINEAR magnitude limit 
are still active out to 10AU from the Sun. 

Fig |3 shows an intriguing correlation between per- 
ihelion distance and semi-major axis in the LIN- 
EAR sample. Thi s corr elation was first noted by 
iMarsden fc Sekaninal (^)73) at smaller perihelia. They 
suggested that it was a selection effect. Dynamically 
new comets ar e brighter at large heliocentric radii (eg. 
lWhipplel lT978). presumably due to extra outgassing at 
large heliocentric distances from their relatively pristine 
surfaces, due perhaps to CO2 or a water ice phase transi- 
tion. The Whipple data did not extend to distances be- 
yond 4AU from the Sun. If this trend continues to larger 
heliocentric distances, however, it would make dynami- 
cally new comets far brighter than older comets with the 
same absolute magnitude. This could thus, in principle, 
bias the sample heavily towards new comets, and hence 
larger semi-major axes. 

The distribution of perihelion positions is shown in 
Fig El The comets are weakly concentrated at intermedi- 
ate galactic latitudes (Fig 01, consistent with the galac- 
tic tide playing a major pa rt in making them observable 
ijMatese fc Whitmird ll996). The galactic latitude dis- 
tribution is not, however, significantly different from the 
predictions of a best-fit model f§ l5.3[) assuming a random 
distribution, as measured by the Kolmogorov-Smirnov 
(KS) test or the Kuiper statisti c. There are also no si g- 
nificant great-circle alignments (Horner fc Evansl [2002). 

3. MODEL OF THE LONG PERIOD COMET POPULATION 

The LINEAR comet sample was compared against a 
Monte-Carlo simulation of the long period comet distri- 
bution. In this section I discuss the parameters used in 
this simulation. 

3.1. Orbital Parameters 

The lEverhartl l)1967b(l and iHughesI l)200H) studies are 
based on comets with a limited range of perihelion dis- 
tances q and hence give only weak constraints on this 
distribution. Everhart found a factor of two rise in the 
number of comets per unit perihelion between and 1 
AU, but beyond that the data are consistent either with 
a continuing rise or a flat distribution. Hughes found no 
significant trend in number of comets against perihelion, 
but his data are quite consistent with such a trend. 

On theoretical grounds, however, a gentle rise in the 
number of comets as a function of perihelion is ex- 
pected, as comets diffuse into the solar syst em past the 
barrier of giant planet pe rturbations (eg. iTsuiil Il992t 
iWieeert fe TremainellT9 99). As a first guess, I chose to 
model the perihelion distribution as an unbroken straight 



line: 



where A = 1 gives a rea sonable fit to the | Eve rhartl 
l|1967b|) distribution. The IWieeert fe Tremaind l|1999l) 
slope is shallower, but only shown out to 3AU. 

The distribution of semi-major axes a makes no sig- 
nificant difference to the conclusions, as the comets are 
all very close to being on parabolic orbits. I chose to 
randomly class 40% of comets as dynamically new and 
given them all a =20,000, while the remainder were given 
a value of a randomly and uniformly distributed between 
1000 and 20,000 AU. 

The time of perihelion passage, orbital inclination and 
perihelion direction were randomly chosen to give a 
uniform distribution on the celestial sphere. This ig- 
nores possible great-circle alignme nts llHorner fe Evans 
20021) and galactic tidal effe cts l|Matese fc Whitmire 
1993: IMatese fc Lissa,ueHl200l . 

3.2. Absolute Magnitudes 

lEverhartl l)1967b|) found that the absolute magnitude 
distribution of comets was be st fit by a bro ken power- 
law, with the break at H ~ 6. Hughga l)2001l) also found 
a break at about the same absolute magnitude, but was 
unable to decide whether it was a real break or simply 
the effect of increasingly incomplete samples at fainter 
magnitudes. To bracket the possibilities, I use a broken 
power-law of the form. 

dn / b("- H »\ H <H b f . 

dH « \ f(H-H b)) H>Hb , {6) 

where ff& is the break magnitude, b is the bright end slope 
and / is the faint end slope. Everhart gives Hi, = 6, 
b = 3.65 and / = 1.82. Hughes gives Hb = 6.5 and 
b = 2.2. If I assume that his observed break is real and 
not an artefact of sample incompleteness, his plots imply 
a faint end slope of / = 1.07, which I adopt to bracket 
the possibilities. This version of the Hughes formulation 
thus predicts dramatically fewer faint comets, as would 
be expected as these are the ones for which Everhart 
applies the largest incompleteness correction fraction. 

3.3. Comet Flux 

lEverhartl (|1967b|) and lHughesI (2001) give different esti- 
mates of the long-period comet flux through the inner so- 
lar system. Everhart estimates a flux of 8000 comets with 
H < 10.9 and q < 4 over 127 years. Hughes estimates a 
flux of 0.53 comets per year brighter than H = 6.5 per 
unit perihelion. I ran the simulations using both. 

4. MONTE-CARLO SIMULATION 

For a given model comet population, the aim is to sim- 
ulate the observable properties of a sample that matches 
the selection effects of the LINEAR sample. The simu- 
lation starts off by generating a set of comets that reach 
perihelion within a three year period. The comets are 
randomly generated using the model distributions in the 
previous section. The model extends down to H = 19 
and out to q = 15. 

I generated two model populations: one using the Ever- 
hart absolute magnitude distribution and flux, the other 
using the Hughes absolute magnitude distribution and 
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Fig. 3. — The perihelia and semi-major axes of all comets in the LINEAR sample with class 1A or IB orbit determinations. The 
distribution remains largely unchanged if comets with less well determined orbits are included. 
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flux. In the Everhart model 260,000 comets are gener- 
ated, while only 4,000 are needed in the Hughes model. 

The position of each comet is then calculated at 24 
hour intervals throughout the three year period, and its 
heliocentric distance r, distance from the Earth A, ap- 
parent celestial coordinates and apparent angular veloc- 
ity written to file. Pure elliptical orbits are used: no 
attempt is made to allow for planetary perturbations. 

At each location, the apparent total magnitude is 
then calculated. Comets are notoriously variable in how 
rapidly their apparent magnitude varies as a function of 
heliocentric distance. I parameterize this, as is conven- 
tional, using Equation ^ Two values of n are randomly 
assigned to each comet: one for before perihelion and 
another for after. For the 40% of our simulated comets 
that I set as dynamically new, the prc-pcrihclion value 
of n is chosen from a Gaussian distribution of mean 2.44 
and standard deviation 0.3. Post-perihelion, the mean is 
3.35 with a scatter of 0.27. For the remaining comets, the 
pre-perihelion numbers are 5.0 with a scatter of 0.8, and 
after perihelion 3.5 with a scatter of 0.5. All these val- 
ues are taken from iWhippld l)1978|) . At large distances 
from the Sun, cometary activity will presumably stop, 
and a bare nucleus will have n = 2. The near- ubiquitous 
detection of fuzz around the LINEAR comets implies, 
however, that this only happens further from the Sun 
than our models reach. 

This approach can only be a rough approximation to 
the real radial brightness dependence. The value of n 
for an individual comet is typically time dependent, and 
all the tabulated values are for comets within ~ 3 AU 
of the Sun, whereas our simulation tracks them out be- 
yond 10AU. In addition, comets show occasional flares 
above and beyond this power-law behavior, which I have 
not attempted to model. Such flares might introduce an 
amplification bias, with comets being pushed over the 
detection threshold. As we will see, however, the slope 
of the absolute magnitude distribution is so gentle that 
this is unlikely to be a major effect. 

4.1. Converting total magnitudes to point-source 
equivalent magnitudes 

The apparent total magnitude of each simulated comet 
can now be calculated at any given point in its orbit. 
Unfortunately, in any CCD-based survey, it is the peak 
surface brightness of the coma that determines whether 
something has been seen, not the total magnitude. The 
LINEAR skymaps, furthermore, list only the magnitude 
limit for a point source at any given location on the sky 
(typically around 19). 

As discussed in the introduction, total cometary mag- 
nitudes are notoriously unreliable. Quantitative studies 
prefer more reproducible and physically meani ngful pa- 
rameters such as Afp (eg. lA'Hearn et a □Hill. Unfor- 
tunately, not enough long period comets have been stud- 
ied in this way to derive the Afp distribution. We are 
therefore forced to attempt some conversion between to- 
tal magnitudes and point-source equivalent magnitudes. 

For bright and near-by comet s, this correction can b e 
as large as ~ 5 magnitudes (eg. iFernandez etal] [l999'). 
The comets in the LINEAR sample were, however, typi- 
cally first seen when very faint (FigHJ) , and were generally 
mistaken for point sources in the initial observation. We 
might therefore expect the correction factor to be much 



smaller, at least when close to the detection threshold. 

The histogram of detection magnitudes (FigEJ climbs 
steeply down to H^is ~ 19, and then falls off fast (the one 
comet discovered when fainter than 20th mag was found 
by Spacewatch, which has a fainter magnitude limit). 
This fall-off occurs at almost exactly the same magni- 
tudes as LINEAR's point source limit, which ranged from 
around 18 to 20. I thus conclude that near the LIN- 
EAR detection threshold, total magnitudes and point 
source equivalent magnitudes are similar. When generat- 
ing mock samples, it is only the magnitude near the de- 
tection threshold that determines whether or not a given 
model comet is included in the mock catalog. 

The exact value of this correction value was set iter- 
atively. I initially guess that the point-source equiva- 
lent (PSE) magnitude and total magnitude (TM) are 
the same, and run the simulations of the comet sam- 
ple. I use the model that best fits the data (§ I5.3fl 
to calculate the predicted discovery magnitude distribu- 
tion, and compare this to the observed distribution. I 
then tweak the PSE— TM correction to bring the his- 
tograms into agreement. The best match is obtained 
when PSE-TM= 0.5 ± 0.5 (Fig®. I use a value of 0.5 
throughout this paper, except where otherwise noted. 

The predicted magnitude is corrected for the effects 
of trailing. LINEAR exposure times vary from 3 to 12 
sec: the latter was used in the correction as it minimised 
the predicted number of very faint comets. 2" seeing 
(FWHM) was assumed. Trailing makes very little differ- 
ence, except for the very faintest comets. LINEAR uses 
unfiltcred CCD magnitudes while the historical surveys 
use unfiltered visual magnitudes. These will be some- 
what different, due to the different wavelength sensitiv- 
ity of the human eye and of the LINEAR CCDs, but the 
discrepancy should only be a few tenths of a magnitude 
at most, and hence is not a dominant source of error. 

Anoher possible worry: the absolute magnitudes I 
quote for the comets in the LINEAR sample (Table ^) 
are derived from total magnitudes measured when the 
comets were barely resolved and far from the Sun, using 
a model for the heliocentric brightness variation. The ab- 
solute magnitudes fit by Everhart and Hughes are based 
on observations of highly extended comets observed close 
to the Sun and Earth. These are thus very different 
quantities, and might well be systematically different, if 
there is some error in our heliocentric brightness correc- 
tion, if total magnitudes for barely resolved comets are 
systematicaly different from total magnitudes for greatly 
extended comets, or if there is some systematic bias in 
the discovery magnitudes reported to the central bureau 
of astronomical telegrams. 

To test this, I picked out the five comets in the LIN- 
EAR sample which were discovered when far (more than 
3.5 AU) from the Sun, but which subsequently passed 
close enough to the Earth and Sun for traditional small 
telescope visual magnitude estimates (within 2AU). Dan 
Green kindly provided me with compilations of visual 
magnitude estimates of these comets while they were 
close to the Earth and Sun, taken from the archives of 
the International Comet Quarterly 9 . These visual/small 
telescope magnitude estimates should be broadly com- 
parable to the data on which the Everhart and Hughes 

9 http://cfa- www . harvard . edu/i cq/ icq . html 
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Fig. 6. — The predicted (dotted) and observed (solid) distributions of discovery magnitudes. 



papers were based. I then compared the predicted mag- 
nitudes when close to the sun (based on the discovery 
magnitude and the model in this paper) with the tabu- 
lated observations. There was a considerable scatter in 
the measured visual magnitudes for each comet: I sim- 
ply averaged all visual small telescope magnitudes made 
when the comet was as close as possible to 1AU from 
both Sun and Earth. 

My predicted magnitudes were consistent with the ob- 
served values, albeit with a large scatter. The mean 
difference (predicted magnitude minus observed magni- 
tude) was 0.4 ± 0.7, where the error indicates the la 
dispersion of the mean. This is not, alas, a strong con- 
straint, but does indicate that the two magnitude scales 
are not grossly discrepant. 

4.2. LINEAR 's Sky Coverage 

The final step is to determine whether LINEAR im- 
aged a part of the sky in which the comet was detectable 
and within its magnitude limit. The published LINEAR 
skymaps show that during each dark period in 2000-2002, 
they attempted to survey the region whose midnight hour 
angle is in the range — 7 < HA < 7, and in the declina- 
tion range — 30 < S < +80°. In winter months with good 
weather, they surveyed more than 90% of this whole re- 
gion down to a point-source magnitude limit of better 
than 19. In bad months, this dropped to a magnitude 
limit of around 18.5 over 60% of this region, and occa- 
sionally worse. In the mean month, 72% of this region 
was surveyed to a visual magnitude limit of 18.5 or bet- 
ter. The exact pattern surveyed was complex and vari- 
able: the only constant was that the densest regions of 
the galactic plane were avoided. Each field was imaged 



five times in succession, with 3-12 sec per exposure, 
once in every dark period. 

This sky coverage was approximated as follows. Each 
comet that enters the -7 < HA < 7, -30 < <5 < +80° 
region at any point, with a point-source equivalent (PSE) 
magnitude brighter than 19 is considered to have been 
potentially observable, unless it was within ten degrees 
of the galactic plane. If a comet is predicted to be de- 
tectable for an entire lunation, it is given a 80% chance 
of having been detected during that lunation. If it was 
predicted to be visible for less than the whole lunation, 
it is given a probability of having been detected equal 
to 80% of the fraction of the lunation for which is was 
potentially observable. 

Does this approximation match the real, more complex 
selection function? This was tested by manually checking 
100 simulated comet ephemerides, containing monthly 
positions and magnitudes, against the real LINEAR sky- 
maps. The approximation was found to give a number 
and absolute magnitude distribution of detected comets 
indistinguishable from the manually checked sample. 

This approach should slightly overestimate the prob- 
ability of a comet being observed, as comets could be 
blended with star or galaxy images. Experience suggests 
that at this relatively bright magnitude limit, this is only 
a few percent effect at worst, at least away from the 
denser regions of the galactic plane, which the survey 
did not cover. 

Another possible source of error is sky subtraction. It 
is unclear exactly how the LINEAR survey do their sky 
subtraction, but if some of the extended coma emission 
is included in the sky value, this will artificially suppress 
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the point source equivalent magnitude. The worst case 
sky subtraction algorithm would be to measure the sky 
brightness from an annulus close to the comet nucleus. 
If I assume that the sky is measured only 5" from the 
nucleus (unlikely), we can used the obs erved 1/r surface 
brigh tness profiles of cometary comae Ipewitt fc Meechl 
11987(1 to show that this sky subtraction algorithm would 
reduce the measured comet brightness by ~ 0.2 mag. 
More plausible sky subtraction schemes would reduce it 
by less, or not at all. Thus this too is not a dominant 
source of error. 

5. COMPARISON WITH THE MODELS 

In this section, I compare the data to the Monte-Carlo 
simulations of what LINEAR would have seen over a 
three year period. In the scatter plots, the data are 
compared to a single run of the simulation. In all his- 
tograms and quoted statistics, however, the data are 
compared against the average or sum (as appropriate) 
of five Monte-Carlo runs based on the same comet popu- 
lation model. This summation should suppress the error 
due to small number statistics in the simulated samples 
to well below that of the observed sample. 

5.1. Everhart 

The observed sample properties are first compared 
against the Monte-Carlo prediction using the lEverharil 
<|1967b[) flux normalization and absolute magnitude dis- 
tribution. 

Figs 0|H1 and El compare the distribution of model and 
observed comets in perihelion q and absolute magnitude 
H . The upper boundary to the locus of points is set 
by the magnitude limit, and seems a reasonable fit to 
the data. But the model clearly predicts far too many 
comets: 2228 as compared to the 52 observed. 

The discrepancy is primarily at fainter absolute mag- 
nitudes: brighter than H ~ 7 the model and data are 
consistent. The worst discrepancy is for comets fainter 
than H ~ 11: ie. fainter than the data on which Ever- 
hart based his model. It is thus a test of the power-law 
extrapolation. The model predicts that LINEAR should 
have seen 1848 comets fainter than H = 11, whereas only 
12 were seen. Irrespective of the flux normalization, the 
shape of the absolute magnitude distribution (FigEJ is 
wrong: a KS-test comparison with the observed distri- 
bution shows that they are inconsistent with > 99.99% 
confidence. 

Is this discrepancy real, or is there some reason why 
LINEAR would miss faint comets close to the Earth? 
The model comets with H > 11 are predicted to be ob- 
servable for a median period of 52 days, at a median 
distance from the Earth of A = 1.0 AU. Their typical ap- 
parent angular velocity is predicted to be ~ 1.1 degrees 
per day. Their observational properties are thus typical 
of near Earth objects, which LINEAR finds in profu- 
sion. It is thus hard to see what selection effect could 
prevent their detection. Grant Stokes (personal com- 
munication) confirms that there is nothing in their data 
analysis which should preclude the discovery of comets 
like these. Image trailing and short observability win- 
dows do reduce the number of these comets seen, but 
these are already taken into account by the Monte-Carlo 
simulation. 

Could the discrepancy be an artifact of the various 



approximations made in the model? The discrepancy 
is insensitive to assumptions about the dependence of 
brightness or comet number on heliocentric distance, as 
these sources are observed at close to 1AU. One possibil- 
ity is that I have incorrectly estimated the effective mag- 
nitude limit of LINEAR for sources with these apparent 
total magnitudes. To test this, I repeated the analysis 
with a magnitude limit set two magnitudes brighter than 
my best estimate. This reduced the discrepancy but did 
not remove it: the prediction dropped to 577 observed 
comets fainter than H = 11, still more than two orders 
of magnitude above the data. None of the plausible in- 
completenesses in the data, nor other assumptions in the 
model can come close to removing this discrepancy. I 
therefore conclude that the Everhart model cannot be ex- 
trapolated to absolute magnitudes fainter than H = 11. 

Even at brighter absolute magnitudes, however, there 
remains a substantial discrepancy. The model predicts 
that 188 comets with H < 11 and q < 4 should have en- 
tered the solar system within the three year period, and 
that 86% of them (162) would have been detected by 
LINEAR. Only 21 such comets were observed. It is hard 
to see that LINEAR could have missed many comets this 
bright passing this close to the Sun. The model predicts 
that these comets should remain visible for a median 208 
days (7 lunations), so almost regardless of position on the 
sky, they should have had several opportunities to be ob- 
served. They spend much of this time many magnitudes 
above the survey detection limit. Indeed, the 21 comets 
observed with these properties were discovered a median 
11 months before perihelion, at a median heliocentric dis- 
tance of 4.3AU, confirming that these are easy targets. 
The discrepancy occurs mostly at the fainter magnitudes 
within this range: brighter than H ~ 7 there is no sig- 
nificant difference between the Everhart predictions and 
the LINEAR observations. 

I conclude, therefore, that the Everhart model fails in 
two ways. Firstly, the quoted normalization of 8,000 
comets per 127 years with H < 10.9 and q < 4 is too 
high by a factor of ~ 7. Secondly, the faint end slope 
of the Everhart absolute magnitude relation is much too 
steep, and immensely over-predicts the number of faint 
comets. This second conc lusion was first reached by 
iSekanina fc Yeomansl 0.986): the current paper indepen- 
dently confirms their result. 

Both discrepancies suggest that Everhart overesti- 
mated the incompleteness of his sample of long-period 
comets. The discrepancy goes away where the incom- 
pleteness correction is small, but is largest at the faint 
magnitudes where the correction is large. A discrepancy 
here is perhaps not surprising, as the correction factors 
calculated by Everhart were so large: he corrected the 
256 observed comets to a flux of 8000: a factor of 31. 
My analysis reduces this correction factor to only ~ 4. 

One possible reason for the difference: Everhart calcu- 
lated the detection threshold for typical historical comet 
searchers, and assumed that the same threshold applied 
when searching for comets initially, and when making 
follow-up observations of known comets. His model was 
validated by noting that the last observations of comets 
occurred close to the time when his model suggested that 
they dropped below detectability. But let us hypothesize 
that comets just above the detection threshold might be 
missed as the telescope speeds past during a scan for 
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Fig. 8. — The perihelia distributions predicted by three different model comet populations, compared to the observed distribution for 
the LINEAR sample. 
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Fig. 9. — The absolute magnitude distributions predicted by three different model comet populations, compared to the observed 
distribution for the LINEAR sample. 
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new comets, even though they could be detected when 
looking hard for an already known comet at a known po- 
sition. This would reduce the length of time over which a 
given comet could have been detected. Detections would 
thus have occurred earlier in the detectability window, 
and the correction factor for incompleteness would thus 
decrease. 

5.2. Hughes Model 

I now compare the data against a Mont e-Carlo simula- 
tion using my version of the lHughesI ((2001) absolute mag- 
nitude distribution and normalization, combined with 
the linearly rising perihelion distribution (Figs 1101) . 
Once again, the overall envelope of points agrees well 
with the model, giving some confidence that the selec- 
tion effects have been modeled correctly. 

The predictions from the Hughes model arc in much 
better agreement with the data. There is no vast excess 
of faint predicted comets, implying that the break seen in 
Hughes' data was real, and hence that the flatter faint- 
end slope of the absolute magnitude distribution is more 
accurate. Neither the perihelion nor absolute magnitude 
distributions, however, are formally consistent with the 
observations at the 99% confidence level, as measured 
by the KS-test. The overall flux normalization is also 
too high: the model predicts that LINEAR should have 
seen 171 comets, rather than the 52 observed. There 
is no significant discrepancy within the region in which 
Hughes quoted his flux normalization (0.54 comets per 
year with H < 6.5 per unit perihelion): the discrepancy 
is at fainter absolute magnitudes and larger perihelia. 
Once again the discrepancy is fairly robust against the 
exact detection limit: dropping the detection threshold 
by a magnitude reduces the predicted comet numbers to 
148 - still too high. 

How can this discrepancy be addressed? Possible in- 
completenesses in the LINEAR sample were discussed in 
§ 12. H and they can at best increase the observed numbers 
by ~ 20%. The Hughes flux normalization is unlikely to 
be too low, as it was based on observed counts of very 
bright comets and made no correction for incomplete- 
ness. I therefore tried to improve the match by tinker- 
ing with the absolute magnitude and perihelion distribu- 
tions. 

5.3. The Best-Fit Model 

I first tried reducing the faint end slope of the abso- 
lute magnitude relation. If we assume that the LINEAR 
sample is 20% incomplete, we need to reduce the faint 
end slope from 1.07 down to 0.8 to bring the number of 
predicted comets down to the observed number. Unfor- 
tunately, this changes the observed absolute magnitude 
distribution too much: a KS test shows that a model with 
this slope predicts an observed H distribution inconsis- 
tent with the data with greater than 99.99% confidence. 

I then tried changing the perihelion distribution. De- 
creasing the slope A in Eqn to zero brought the pre- 
dicted number of comets down to 95, but the perihelion 
distribution is now too skewed towards small values of q 
(99.96% confidence). 

I next tried combining both approaches. Decreasing 
the faint end slope to 1.0, combined with a flat peri- 
helion distribution, brought the predicted numbers into 
line with the observed numbers. Both the perihelion 



and absolute magnitude distributions were individually 
marginally acceptable (KS-test gave 8 and 5% probabil- 
ities of them coming from the same population as the 
data) but the joint probability was still uncomfortably 
low (though the magnitude limit cut means that the 
two distributions are not independent, so this should not 
be taken too seriously). The model predicted too many 
comets with q < 1 and q > 5, and too few in the middle. 

Moving the location of the break in the absolute mag- 
nitude relation to fainter magnitudes was also a failure: 
given that the comet flux normalisation is at brighter 
magnitudes, this simply increased the number of fainter 
comets still further above the observations. 

I therefore adopted a different perihelion distribution: 
one that rises from q = out to q = 2, and is a power-law 
beyond that. This preserves the Ever hart observation of 
a drop in comet numbers below q = 1, while allowing us 
to tinker with the distribution further out. The param- 
eterization used was: 

f 1 + y/q, Q < 2AU . } 

dq^X 2.41 x (q/2)i, q > 2AU W 

where 7 controls the behavior at large perihelia. 

I then ran a grid of models, varying 7 and the faint end 
slope of the absolute magnitude distribution (/ in Equa- 
tion El • Each simulated population was tested against 
the data in three ways: a KS-test on the perihelion dis- 
tribution, a KS-test on the absolute magnitude distri- 
bution, and a Chi squared test on the overall predicted 
number of comets. The latter was done for both the 
observed number of comets and a number 10% higher, 
to allow for possible incompletenesses. The lowest of 
these three significance values was used to compute the 
goodness-of-fit conutours in Fig ^2 Note that these con- 
tours include only random errors: the systematic errors 
are almost certainly larger, especially on 7. The lowest 
probability rather than the joint probability was used 
becuase the three tests are not strictly independent. 

Quite a tight constraint could be placed on /: / = 
1.03 ± 0.09 (95% confidence, not including systematic 
errors). The constraint on 7 was weaker: 7 = — 0.27±0.3. 
No useful constraint could be plac ed on the brig ht-end 
slope b: in the modeling I use the iHughesI |2001) value 
of 2.2, but it makes little difference. Note that these 
slopes are often described in the literature using the a 
parameter: a = log 10 (slope), so our best-fit faint end 
slope has a — 0.004. 

The predicted distribution of comets for the best fit 
model is shown in Figs |H1 El and ^] Models generated 
using this model predict 55 ± 3 (Ict) observed comets, in 
excellent agreement with the data. 

This model was used to predict the discovery magni- 
tude distribution (Fig EJ) . This comparison was used to 
set the equivalent point source vs. total magnitude offset, 
as described in S I4.1I 

My preferred model thus approximately preserves the 
faint-end slope and normalization derived by Hughes. A 
flat perihelion distribution is marginally ruled out, and 
best fits are obtained for one that rises from the Sun out 
to q ~ 2 and is either flat or gently falling beyond that. 

6. DISCUSSION 
6.1. The Comet Flux 
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Fig. 10. — The perihelia and absolute magnitudes of model comets (crosses) and the LINEAR sample (triangles). The model comet 
population was generated using the Hughes flux and absolute magnitude distribution. 




Fig. 11. — Goodness-of-fit contours as a function of model parameters 7 and /. The solid line is the 95% confidence contour, and the 
dotted line is the 99% contour. 
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Fig. 12. — The perihelia and absolute magnitudes of model comets (crosses) and the LINEAR sample (triangles). The model comet 
population was generated using our best fit model of the comet population. 
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The best-fit model can be used to estimate the flux 
of long-period comets through t he inner solar system. 
By definition, this model uses the lHughesI J2001 flux of 
0.53 comets per year per unit perihelion, brighter than 
H = 6.5. 

lEverhartl l)1967bfl estimated a flux of 8000 long-period 
comets per 127 years with H < 10.9 and q < 4. My 
model suggests that the flux of comets with these pa- 
rameters is much lower: 11 long-period comets per year 
(1600 per 127 years). The model suggests that LINEAR 
is picking up over 60% of comets with these parameters. 

I estimate a true flux of 37 comets/year with H < 15 
and q < 8, of which LINEAR is detecting ~ 40%. 

6.2. The Number of Oort Cloud Comets 

Many published esti mates of the numb er of comets in 
the Oort cloud use the lEverhartl (^67b) comet flux as 
their starting poin t, and hence should be revised down . 



ev fc Stagd 1T9 88L Hcislerl Ip90j) . 



Pone s et alJ (12004). 



Estimates i nclud e IB ai 
Wcis smanl l)1996|) and 

The LINEAR sample includes 22 comets with H < 
10.9 and q < 4 over a three year period. 5 of these were 
dynamically new, 11 dynamically old, and 6 had orbit 
determinations too poor to tell. If we assume that the 
comets without good orbit determinations break up be- 
tween new and old in the same ratio as the other comets, 
we find a flux of 7 dynamically new comets over the three 
years. My model suggests that the LINEAR sample is 
~ 80% complete for finding comets in this range, imply- 
ing a total flux of dynamically new comets of ~ 3 per 
year. This corresponds to ~ 0.8 per unit perihelion per 
year if a uniform perihelion distribution is assumed . This 
is a fa ctor of ~ 2 lowe r than was assumed by iHeislerl 
fL990h and IDones et alJ pOO^ . iBailev fe Staed (|1988f) 
and IWeissmanl l)1996J ) assume long-period comet fluxes 
(not just the dynamically new ones) of ~ 10 per year 
per AlJ down to H < 10.9, compared to my value of 
~ 2.5. After correction, all these estimates come out 
roughly the same: 1 - 3 x 10 11 in the outer Oort cloud, 
down to H < 10.9. If this is extended to fainter absolute 
magnitudes, I estimate an outer Oort cloud population 
of ~ 5 x 10 11 comets down to H = 17. 

6.3. The Mass of Oort Cloud Comets 

My model has a much shallower slope of the abs olute 
magnitude distribution than that of lEverhartl l)1967b|) . so 
the average mass of a comet increases. This cancels out 
the decreased number of Oort cloud comets I predict to 
give a similar total Port cloud mass to pr evious estimates 
(eg. IWeissmanl fl996l IDones et all 120041) . both of which 
used a mean mass computed by integrating the Everhart 
curve. 

I used two suggested mass-brightness relations to es- 
timate masses from the observ ed absolute magnitudes: 
one fr om IBailev fc Stagd l)1988|) and one from IWeissmanl 
(1996). Note that this relation is extremely uncertain - 
very few long-period comets have had even their nuclear 
magnitudes measured. 

The average mass is crucially dependent on the slope 
of the bright end of the absolute magnitude distribution, 
which the data in this paper do not constrain. I bracket 
the possibilities by using both the Everhart and Hughes 
values (3.54 and 2.2 respectively). I use my own estimate 
of the faint-end slope. 



For the Everhart bright-end slope, the total mass con- 
verges as you go to brighter magnitudes: the bulk of the 
mass resides in comets with < H < 9. For the Hughes 
bright-end slope, however, the total mass diverges as you 
count brighter comets. The bright comets are rare, but 
their mass goes up faster than their number goes down 
at bright magnitudes. Thus for the Weissman mass re- 
lation, comets with H ~ —5 are 10,000 times more mas- 
sive than those with H ~ 5. The Hughes bright-end 
slope, however, suggests that they are only 2,600 times 
less common, so the total mass in the brighter comets 
is actually four times greater. For the Everhart bright- 
end slope, however, the brighter comets would be 300,000 
times less common. 

The Everhart slope is thus physically more appeal- 
ing, as it avoids the need for a bright cut-off. Given 
the success of both the Hughes normalization and faint- 
end slope in fitting our sample, however, his bright-end 
slope should perhaps be taken seriously, leading to the 
prospect of an Oort cloud dominated (in mass terms) by 
very large comets. In this section, I will cut off the mag- 
nitude range of comets at H = —5, but this is arbitrary 
and will have a large effect on the total Port cloud mass 
if the Hughes distribution is assumed. iBernstein et alJ 
( 2004) showed that Kuiper Belt objects have a break in 
their mass distribution at sizes of ~ 100 km, which cor- 
responds to H ~ —3 fS I6.7J) . This may or may not apply 
to Port cloud comets. 

If I take the Everhart slope, the me an mass of c omets 
down to H = 11 is 1.7 x 10 17 g for the IWeissmanl (1199' 
mass relation, and 5.6 x 10 16 g for the Bai lev fc Stag 
( 1988) relation. If I take the Hughes slope, however, 
the average masses rise to 1.1 x 10 18 g and 1.2 x 10 18 g 
respectively. These values translate into total outer Port 
cloud masses of 2 - 40 Earth masses. 

6.4. Terrestrial Impact Probabilities 

My model can be used to calculate th e probability of a 
long-period comet impacting the Earth. iSteell l)1993(l cal- 
culated that the probability of a given long-period comet 
with q < 1AU impacting the Earth is ~ 3 x 10~ 9 per 
perihelion passage. My model suggests that the flux of 
comets brighter than H = 19 with q < 1AU is 8 per 
year. The mean time between comet collisions with the 
Earth is thus ~ 40 mill ion years: very comparab le to the 
figure calculated bv lSekanina fc Y comans l)1986|) . and to 
the mean time between global extinction events. 

Most of these comets will, however, be quite small. 
Using the conversion betw een absolute magnitude and 
radius described in § 16.71 a comet with H ~ 19 has a 
radius of only ~ 20m: too small to cause a global extinc- 
tion event. Comets with radii of 1km or greater (H < 9) 
are rarer - the mean time between collisions with comets 
this large is ~ 150 million years. 

The shallow faint end slope means that collisions with 
even small long-period comets are rare. If, for exam- 
ple, the Tungusk a impact was caused by a comet (eg. 
iBronshter I2000D . it would have a mass of ~ 10 n g 
l)Vasilvevl 119981) and hence an absolute magnitude of 
~ 18. The probability of such a long-period comet im- 
pacting the Earth in the last 100 years is thus < 10~ D . 
The Tun guska impactor must therefore be ei ther aster- 
oidal feg lSekaninalll998tlFarinella et aLll200lD or assoc i- 
ated with a short-period comet Ceg. lAsher fc Steellll998|) . 



17 



6.5. Detection of Interstellar Comets 

No comet has ever b een detected with a strongly hy- 
perbolic original orbit l)Kresaklll992j) : ie. a comet that 
was not gravitationally bound to the so lar system. Sev- 
eral authors have discussed this ( eg. ISekaninal Il976t 
McGl vnn fc Chanmanl H983 ISternl Il990t ISen fc Ramal 
199.lt) . with many claiming that this is surprising. Oort 
cloud formation models predict that for every comet 
that reaches the classical outer Oort cloud, a factor 
77 = 3 — 100 more are expelled into interstellar space (eg. 
IDuncan. Quinn fc Tremaind 11987]) . If most stars have 
planets, and if planetary formation is usually accompa- 
nied by comet ejection, then there should be a substantial 
population of free-floating interstellar comets. By some 
estimates, we should have expected to have seen one or 
more such comets by now, passing through the inner so- 
lar system. 

The results in this paper impact upon this question 
in two ways. Firstly, the non-detection of interstellar 
comets in the LINEAR sample places an upper limit on 
their space density. Secondly, most previous estimates of 
the expected number of interstellar comets relied upon 
the lEverhartl l)1967b|) comet flux: our lower comet flux 
thus leads to smaller predictions of the interstellar comet 
density. 

What limit can we place upon the space density of in- 
terstellar comets (those with strongly hyperbolic orbits) 
from the non-detection of any by LINEAR? I will assume 
that interstellar comets have the same absolute magni- 
tude distribution that I derive for long period comets, 
and that their apparent brightness varies with heliocen- 
tric distance in the same way as dynamically new in- 
bound comets in our model. Given these assumptions, 
and an assumed magnitude limit of 19 (point source 
equivalent), one can derive the distance r out to which 
a comet with any given absolute magnitude could have 
been detected. To convert this into the volume surveyed 
during the three years of the survey, one must allow for 
the motion of the comets with respect to the solar system, 
which can carry new comets into range. Typical motions 
of nearby stars with respect to the Sun are v <~ 40km s _1 
ijGarcia-Sanchez et alJ l20011 . The volume surveyed in a 
survey of duration T is thus 

V = -tit 3 + Tir 2 Tv. (5) 
3 

The product of this equation and the absolute magnitude 
distribution (Equation[3J| was integrated to calculate the 
number of comets potentially within LINEAR's magni- 
tude limit, for a given assumed space density of interstel- 
lar comets (defined as the number of interstellar comets 
per cubic astronomical unit brighter than H = 11). The 
integral suggests that the bulk of interstellar comets de- 
tected will be those with absolute magnitudes near the 
break at H — 6.5. The results are quite sensitive to 
the adopted bright-end slope of the absolute magnitude 
distribution, being 40% lower for the Everhart slope as 
compared to the Hughes slope. 

Not all of these comets will be seen: my model sug- 
gests that LINEAR finds ~ 70% of the Oort cloud comets 
within its magnitude limit. The fraction may be lower for 
interstellar comets because they move faster and hence 
are not observable for long, but the velocity difference is 
only < 50%. Furthermore, a larger fraction of interstel- 



lar comets will be brght ones seen at large heliocentric 
distances, where the visibility period is larger. I adopt 
a conservative 50% detection probability, which should 
be ample to include comets not being followed up or not 
having good orbit determinations. The mean number of 
comets seen over the three years is then evaluated as a 
function of the assumed average density. If more than 5 
comets are predicted to have been observed, the Poisson 
probability of us having not seen any interstellar comets 
is less than 5%: this is our adopted limit. 

I thus derive an upper limit on the local space den- 
sity of interstellar comets of 6 x 10~ 4 per cubic AU (95% 
confidence) if the Hughes bright-end slope is assumed. 
For the Everhart bright-end slope, this limit increases to 
9 x 10~ 4 per cubic AU. These limits a re very compara - 
ble to the best existing limit: that of ISekaninal (1976). 
Sekanina's limit is, however, based upon Everhart's pa- 
pers and should thus be regarded with some suspicion. 
I can extend this calculation by noting that LINEAR 
has not discovered any interstellar comets in other years. 
From 1999 through to the end of 2004 their monthly sky 
coverage (though not available in detail) is at least com- 
parable to that during my sample period. These extra 
three years of data reduce our upper limits to 3 — 4.5 
xl0~ 4 per cubit AU. 

I now evaluate the expected space density of interstel- 
lar comets, given o ur reduced P ort cloud population es- 
timate. Following ISternl (^90), the number density of 
interstellar comets n- lsm is given by: 

^-ism — V ^stars A^comcts ; (6) 

where n s tars is the local number density of stars, A^ comet s 
is the mean number of outer Oort cloud comets per star, 
and 77 is the ratio of comets expelled from a solar system 
to the number ending up in the outer Oort cloud. 

I adopt rtstars ~ 0-1 p er cu bic parsec, from the 8pc 
sample of iReid fe Crizis]j^)9jj]^_ this is consis tent with 
the value used bv lMcGlvnn fc Ohapmanl ll 19891) but con - 
siderably lower than the value used bv ISternl {l990). 
For A comcts , I assume that the number of comets gen- 
erated per star is proportional to its mass and metal- 
licity. This may or may not be true, but is consis- 
tent with the observed ten dency for low mass stars to 
lack hot Jupiter planets (eg. lGrether fc Lineweaverl2005L 
and refs therein). The local stellar population is domi- 
nated by low mass dwa rfs: the average stell ar mass of 
the nearby stars in the IReid fc Gizid lfl9 97') catalogue 
is only ~ 0.3M Q . The average meta llicity of near-by 
F an d G stars is [Fe/H] ~ -0.14 l|Nordstrom et all 
2004): no information is available for local dwarf stars, 
so I assume the same value. Assuming that the outer 
Oort cloud population derived in § 16.21 is typical of lo- 
cal stars with solar mass and metallicity, I therefore 
derive A comcts = 0.3 x 0.72 x 2 x 10 11 = 4.3 x 10 10 . 
The most uncertain para meter is y: literature values 
range from 3 - 100 (eg. IDuncan. Quinn fc Tremaine] 
[T987 [lMcGlvnn fc ChaDmaJl98ftlSterJl99(trWeissmai1 
1996). 

Given these parameters, 1.4 x 10~ 6 < n iBm < 4.7 x 
10~ 5 . Thus even in the most optimistic case, the pre- 
dicted flux is an order of magnitude below current limits. 

Could future surveys reach these predicted densities? 
I ran the simulation for five year surveys reaching to 
deeper magnitude limits. A survey reaching 24th magni- 
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tude (perhaps PanStarrs) would place 95% upper limits 
of ~ 5 x 10~ 5 , and might hence detect o n e or t wo inter- 
stellar comets if r\ is very large. Uewitti l)2003|) reached 
similar conclusions. A survey reaching 26th magnitude 
(LSST?) would push the limit down to - 1.5 x 1(T 5 , 
which would be enough to detect or rule out a large value 
of 7]. This prediction does, however, depend crucially on 
the assumed brightness behavior of comets a long way 
from the Sun. 

6.6. Perihelion Distribution 

Simple models which assume that Oort cloud comets 
have an isotropic velocity dispersion imply a constant 
number of comets per unit perihelion, n-body integra- 
tions feg. lTsuiilll992HWiegert fc Tremainelll999ft . in con- 
trast, imply a rising number of comets at larger perihe- 
lia, due to the diffusion process of co mets p a st the per- 
turbations of the giant pla nets. Both iTsuiH l)1992j) and 
iWiegert fc Tremainel (pj>99), for example, predict an in- 
crease in the number of comets per unit perihelion of 
~ x2 between q = 2 and q = 8 (though this is an ex- 
trapolation of the Wiegert & Tremaine model which is 
only shown out to q = 3). Is this consistent with the 
LINEAR sample? 

The best-fit model has the number of comets per unit 
perihelion (beyond q — 2) going as a power law of index 
7 = -0.27 ± 0.3 (95% confidence), ie. a g entle fall. To 
get a rise in numbers co n sisten t with the iTsuiil l)1992ft 
and IWiegert fc Tremainel l)1999t) predictions, we require 
7 ~ 0.5, which is inconsistent with our model with 99% 
confidence. 

The measured perihelion distribution is, however, 
somewhat degenerate with the assumed dependence of 
comet brightness on heliocentric distance: n in equa- 
tion ^ As I have repeatedly noted, the choice of n is an 
approximation based on extrapolations of observations 
obtained at much smaller heliocentric distances. 

We can estimate the change in n that wo uld be needed 
to bring our observations into line with the ITsuiil l)1992f) 
predictions. We need to drop the predicted brightnesses 
of comets with q ~ 8 by enough to reduce the observed 
numbers by 50%, to meet our 95% upper limit. Given our 
best-fit absolute magnitude relation, this requires that n 
be increased by An > 0.8. 

This is quite a small rise - well within the observed 
scatter of n values seen at lower perihelia. If, for exam- 
ple, I had used the canonical value of n — 4 for all comets, 
rather than our more complex scheme, this would make 
dynamically new comets much fainter when far from the 
Sun, as required (at the expense of the correlation seen in 
FigEJ- The data in this paper, while suggestive, are not 
therefore significantly at odds with the theoretical predic- 
tions. A better understanding of heliocentric brightness 
variations when distant from the Sun will be needed to 
see if this anomaly is real. 

6.7. Implications of the Shallow Faint-End Slope 

The faint-end slope of the absolute magnitude relation 
derived in this paper (1.01±0.11) is very flat: the number 
of comets per unit magnitude barely increases as you go 
fainter. This presumably indicates that small comets are 
not that much more abundant than large ones. If the ab- 
solute magnitudes are converted to nuclear masses, using 



either the lBailev fc Stagd l)1988j) or lWeissmar] l)1996[) re- 
lations, I find that the differential number of comets per 
unit mass m goes as: 

^ocm- L04±al . (7) 
dm 

Bra ndt et alJ l|1996j) made the case for the existence 
of small comets: those with nuclei o nly meters to tens 
of meters in radius. If we assume the IWeissmanl i|1996|) 
relationship between mass and absolute magnitude, and 
a density of 0.6g cm" 3 , 100m radius corresponds to H ~ 
15 and 10m to H ~ 20. LINEAR is therefore detecting at 
least a few comets with nuclei smaller than 100m. Simi- 
lar small comets are also detected by the LASCO instru- 
ment on the SOHO spacecraft l)Biesecker et alJ [2002') . 
though these are mo stly fragments of recently disinte- 
grated larger comets l)Sekanina fc Chod"asll2004D . 

IHughesI i200l was unclear on whether the shal- 
low faint-end slope was real or a selection effect: I 
confirm that it is real. ISekanina fc Yeomansl (|1986f) 
pointed out that small long-period comets must be 
rare f rom statistics of comets passing close to the 
Earth. iMeech. Hainaut fc Marsdenl <|2004l) found a simi- 
lar paucity of small Jupiter-family comets. 

In this section, I point out that the shallow- 
ness of the faint-end slope is actually quite in- 
teresting th eoretically. Collisions are rare in the 
Oort cloud l|Sternl Il988j) . so the nuclear size distri- 
bution should remain largely unchanged from when 
the proto-comets we re planetesimals expelled from the 
protoplanetary disk |D uncan 1 Quinn fc Tremainel 119871 
iGoldreich. Lithwick fc Sarill2004l) . 

iWeid cnschilling ( 1997j) modeled the size distribution 
of planetesimals in the protoplanetary disk. These mod- 
els imply that 100m diameter objects should outnum- 
ber 2km sized objects by a factor of ~ 10 10 (per unit 
log radius) in the oldest disks model ed. Our fa i nt-end 
slope, however, if combined with the IWeissmanl l|1996f) 
mass/absolute magnitude relation, implies a ratio of only 
~ 10 4 . 

The size distribution of planetesimals may be greatly 
modified by collisions while they lie within the dense 
environment of the proto-planetary disk. Most models 
of this collisional evolution also, however, predict much 
flatter size distributions than we see: they typically pre- 
dict an increase in comet numbers, even at faint magni- 
tudes, of a factor of ~ 3.2 per magnitu de ( Dohnanyi|1969j; 
IWetherill fc Ste^FmM IHharnoz fc Morbid el lil 12003ft . 
This is much greater than my measurement (1.03±0.09). 

As noted in § 15.11 our shallow faint-end slope is quite 
robust to sample incompletenesses and model assump- 
tions. The conversion of absolute magnitudes to radii 
is, however, highly uncertain even at bright magnitudes, 
let alone the faint absolute magnitudes of releva nce here. 
Theoretical predictions are also quite uncertain l)Lissauerl 
1993; lYoudin fc Shvj l2"002). and may be consistent with 
this slope. Another possibility, however, is that the 
probability of a planetesimal escaping the protoplane- 
tary disk and reach i ng th e Oort cloud is size-dependent. 
iStern fc Weissmanl 1)200 lj) . for example, suggested that 
collisions between planetesimals would act to circularize 
their orbits, and would prevent them from escaping into 
the Oort cloud until the density of planetesimals wa s 
greatly depleted (see also iCharnoz" fcMorbidellil [20031. 



19 



The square-cube law would suggest that this effect is 
most serious for smaller planetesimals, which may be 
ground down to dust before escaping. This could thus ex- 
plain the deficit of small comets. Alternatively, gas drag 
could play the same role: the largely gaseous nature of 
the giant planets indicates that the protoplanetary disk 
was still full of gas when the giant planets formed. No 
published modelling currently includes this effect. 

Another possibility: the square-cube law implies that 
small comets may loose their volatiles faster than large 
ones. Fading may thus be more severe for these comets, 
making them harder to detect. A final possibility is that 
comets have been exposed to high temperatures at some 
point in their history. The resultant loss of volatiles 
could thus destroy small co mets without much affecting 
the nu mbers of large ones. IMeech. Hainaut fc Ma rsden 
( 2004) tentatively suggest this as a reason for the lack 
of Jupiter-family comets with small nuclei. Long period 
comets might have been exposed to high temperatures 
while still in the proto-planetary disk. Once in the Oort 
cloud, temperatures are much lower. Nearby supernovae 
and O-stars may have te mporarily hea ted them enough 
to remove some volatiles l)Sternl 1200^ . but this should 
not appreciably affect the numbers of small comets. 

I note in passing that my models are strongly i ncon- 
sistent with the claims by Frank & Sigwarth (2001J) for 
an immense population of small comets bombarding the 
Earth. These claims have, how ever, already been ruled 
out in many ot her ways (eg. iBashar fc Dess lerj Il997t 
IHannetran et afflT993lKnowles et ablll 999HHa,rnsil200m. 

6.8. Implications for Future Surveys 

The model in this paper can be used to guide future 
automated comet-searches. The most basic conclusion 
follows from the shallow faint-end slope of the num- 
ber/absolute magnitude relation. This means that to 
find more comets, a survey should always try to maxi- 
mize the area covered rather than going deep in a small 
area. If LINEAR, for example, exposed for six times as 
long per field, it would be sensitive to fainter comets, and 
would hence see ~ 1.5 times more comets per unit area. 
But it would cover a six times smaller area. 

The more frequently a survey covers a given area of 
sky, the higher the probability of a given comet being 
detected. In Fig El however, I show that this is not an 
enormous effect: decreasing the survey frequency for a 
hypothetical survey from weekly to biannual only drops 
the number of detected comets by ~ 20%. The comets 
lost are primarily those with fainter absolute magnitudes, 
because their visibility period is small. The small number 
lost is a direct consequence of the shallow faint-end slope: 
there are few small comets to lose. 

In Fig El I show the predicted samples that would be 
found by telescopes using a similar survey technique to 
LINEAR, but going deeper. The deeper limit essentially 
increases the absolute magnitude limit reached at all per- 
ihelia, and by increasing the time over which comets 
are visible, also improves the completeness for brighter 
comets. Over 3 years, a survey to 20th mag would de- 
tect 103 comets, to 22nd mag would detect 150 and to 
24th m ag, 186 comets. This is much fewer than Uewittl 
(2003! estimated. The discrepancy is probably due to 
my shallow inferred absolute magnitude and perihelion 
distributions, which mean that the dramatic increase in 



sensitivity of these surveys as compared to LINEAR only 
yields a relatively small increase in sample size. 

I conclude that telescopes such as SkyMapper, 
PanStarrs and LSST, each capable of surveying large ar- 
eas to deeper than 22nd magnitude, should be capable 
of detecting more than 50 long-period comets per year. 
A substantial fraction of these are forecast to have per- 
ihelia beyond 10AU, though this conclusion relies upon 
the rather shaky assumptions of how brightness varies 
with heliocentric distance this far out. Thus five years 
should suffice to build up a quantitatively selected sam- 
ple of long-period comets equal in size to any historical 
sample. 

7. CONCLUSIONS 
Six main conclusions can be drawn from this analysis: 

1. The outer Oort cloud contains ~ 5 x 10 11 comets 
down to H = 17 (~ 2 x 10 11 comets down to 
H = 11). This is 2 — 10 times fewer comets than 
previous published estimates. 

2. The average mass of these comets is, however, 
higher than previous estimates. Down to H = 
11, the average mass is between 5.6 x 10 16 g and 
1.2 x 10 18 g, leading to a total mass in the outer 
Oort cloud of 2 — 40 Earth masses, comparable 
to or larger than previous estimates. The mass of 
the Oort cloud may be dominated by a few large 
comets. 

3. Small comets do exist, but are rarer than predicted 
by many models. This may be because they have 
difficulty escaping from the protoplanetary disk. 
The probability of the Earth being hit by a long- 
period comet similar in energy to the Tunguska im- 
pactor is only one in forty million per year. 

4. I place an upper limit on the space density of in- 
terstellar comets of 4.5 x 10~ 4 per cubic AU (95% 
confidence). This is still an order of magnitude 
above our revised prediction for the space density 
of interstellar comets. 

5. The number of long-period comets per unit perihe- 
lion seems to decline, or at best rise slowly beyond 
2AU. This does not agree with theoretical predic- 
tions. The discrepancy may be resolved if comets 
are fainter at large heliocentric distances than I as- 
sume. 

6. Future survey telescopes should be able to assemble 
samples of several hundred long-period comets in a 
few years of operation. 

The major weakness in this analysis is in the photom- 
etry: in particular in our limited understanding of how 
the brightness of comets varies with heliocentric distance 
when far from the Sun, and in how to convert ill-defined 
total magnitudes into more reproducible and physically 
meaningful parameters such as Afp. The data exist to 
address these problems, but are not publicly available. 

I would like to thank Grant Kennedy, Eriita Jones and 
Chris Weekes for their work on aspects of this paper, 
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Fig. 13. — The predicted long-period comet sample found by a hypothetical survey regularly covering the whole accessible sky down 
to a point-source magnitude limit of 22. Open triangles are the comets that would be found if this region were surveyed once per week. 
Solid triangles are those that would also have been seen if the region were only surveyed twice a year. The sample was obtained for a 
hypothetical three year survey and only includes comets reaching perihelion within these three years. No weather losses were included. 
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Fig. 14. — The predicted long-period comet sample found by a hypothetical survey regularly covering the whole accessible sky once 
per month. Open triangles are the comets that would be found if survey reached a point-source magnitude limit of 24. Solid triangles are 
those that would also have been seen if limit were only 20. The sample was obtained for a hypothetical three year survey and only includes 
comets reaching perihelion within these three years. No weather losses were included. 
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